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Abstract — This paper studies the transmit strategy for a 
secondary link or the so-called cognitive radio (CR) link under 
opportunistic spectrum sharing with an existing primary radio 
(PR) link. It is assumed that the CR transmitter is equipped with 
multi-antennas, whereby transmit precoding and power control 
can be jointly deployed to balance between avoiding interference 
at the PR terminals and optimizing performance of the CR 
link. This operation is named as cognitive beamforming (CB). 
Unlike prior study on CB that assumes perfect knowledge of the 
channels over which the CR transmitter interferes with the PR 
terminals, this paper proposes a practical CB scheme utilizing a 
new idea of effective interference channel (EIC), which can be 
efficiently estimated at the CR transmitter from its observed 
PR signals. Somehow surprisingly, this paper shows that the 
learning-based CB scheme with the EIC improves the CR channel 
capacity against the conventional scheme even with the exact CR- 
to-PR channel knowledge, when the PR link is equipped with 
multi-antennas but only communicates over a subspace of the 
total available spatial dimensions. Moreover, this paper presents 
algorithms for the CR to estimate the EIC over a finite learning 
time. Due to channel estimation errors, the proposed CB scheme 
causes leakage interference at the PR terminals, which leads to 
an interesting learning-throughput tradeoff phenomenon for the 
CR, pertinent to its time allocation between channel learning 
and data transmission. This paper derives the optimal channel 
learning time to maximize the effective throughput of the CR link, 
subject to the CR transmit power constraint and the interference 
power constraints for the PR terminals. 

Index Terms — Cognitive beamforming, cognitive radio, effec- 
tive interference channel, learning-throughput tradeoff, multi- 
antenna systems, spectrum sharing. 



I. Introduction 

Cognitive radio (CR), since the name was coined by 
Mi tola in his seminal work [1], has drawn intensive 
attentions from both academic and industrial communities. 
Generally speaking, there are three basic operation models for 
CRs, namely. Interweave, Overlay, and Underlay (see, e.g., [2] 
and references therein). The interweave method is also known 
as opportunistic spectrum access (OSA), originally outlined 
in [1] and later introduced by DARPA, where the CR is 
allowed to transmit over the spectrum allocated to an existing 
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primary radio (PR) system only when all PR transmissions are 
detected to be off. In contrast to interweave, the overlay and 
underlay methods allow the CR to transmit concurrently with 
PRs at the same frequency. The overlay method utilizes an 
interesting "cognitive relay" idea [3], [4]. For this method, the 
CR transmitter is assumed to know perfectly all the channels in 
the coexisting PR and CR links, as well as the PR messages 
to be sent. Thereby, the CR transmitter is able to forward 
PR messages to the PR receivers so as to compensate for 
the interference due to its own messages sent concurrently 
to the CR receiver. In comparison with overlay, the underlay 
method requires only the channel gain knowledge from the CR 
transmitter to the PR receivers, whereby the CR is permitted 
to transmit regardless of the on/off status of PR transmissions 
provided that its resulted signal power levels at all PR receivers 
are kept below some predefined threshold, also known as the 
interference-temperature constraint [5], [6]. From implemen- 
tation viewpoints, interweave and underlay methods could be 
more favorable than overlay for practical CR systems. 

In a wireless environment, channels are usually subject 
to space-time-frequency variation (fading) due to multipath 
propagation, mobility, and location-dependent shadowing. As 
such, dynamic resource allocation (DRA) becomes crucial to 
CRs for optimally deploying their transmit strategies, where 
the transmit power, bit-rate, bandwidth, and antenna beam 
are dynamically allocated based upon the channel state in- 
formation (CSI) of the PR and CR systems (see, e.g., [7]- 
[14]). In this paper, we are particularly interested in the 
case where the CR terminal is equipped with multi-antennas 
so that it can deploy joint transmit precoding and power 
control, namely cognitive beamforming (CB), to effectively 
balance between avoiding interference at the PR terminals 
and optimizing performance of the CR link. In [14], various 
CB schemes have been proposed considering the CR transmit 
power constraint and a set of interference power constraints at 
the PR terminals, under the assumption that the CR transmitter 
knows perfectly all the channels over which it interferes with 
PR terminals. In this work, however, we propose a practical 
CB scheme, which does not require any prior knowledge of the 
CR-to-PR channels. Instead, by exploiting the time-division- 
duplex (TDD) operation mode of the PR link and the channel 
reciprocities between the CR and PR terminals, the proposed 
CB scheme utilizes a new idea so-called effective interference 
channel (EIC), which can be efficiently estimated at the 
CR terminal via periodically observing the PR transmissions. 
Thereby, the proposed learning-based CB scheme eliminates 



the overhead for PR terminals to estimate the CR-to-PR 
channels and then feed them back to the CR, and thus makes 
the CB implementable in practical systems. 

Furthermore, the proposed learning-based CB scheme with 
the EIC creates a new operation model for CRs, where the CR 
is able to transmit with PRs at the same time and frequency 
over the detected available spatial dimensions, thus named 
as opportunistic spatial sharing (OSS). On the one hand, 
OSS, like the underlay method, utilizes the spectrum more 
efficiently than the interweave method by allowing the CR to 
transmit concurrently with PRs. On the other hand, OSS can 
further improve the CR transmit spectral efficiency over the 
underlay method by exploiting additional side information on 
PR transmissions, which is extractable from the observed EIC 
(more details will be given later in this paper). Therefore, OSS 
is a more superior operation model for CRs than both underlay 
and interweave methods in terms of the spectrum utilization 
efficiency. 

The main results of this paper constitute two parts, which 
are summarized as follows: 

• First, we consider the ideal case where the CR's estimate 
on the EIC is perfect or noiseless. For this case, we derive 
the conditions under which the EIC is sufficient for the 
proposed CB scheme to cause no adverse effects on the 
concurrent PR transmissions. In addition, we show that 
when the PR link is equipped with multi-antennas but 
only communicates over a subspace of the total available 
spatial dimensions, the learning-based CB scheme with 
the EIC leads to a capacity gain over the conventional 
zero-forcing (ZF) scheme [14] even with the exact CR- 
to-PR channel knowledge, via exploiting side information 
on PR transmit dimensions extracted from the EIC. 

• Second, we consider the practical case with imperfect 
estimation of EIC due to finite learning time. We propose 
a two-phase protocol for CRs to implement learning- 
based CB. The first phase is for the CR to observe 
the PR signals and estimate the EIC, while the second 
phase is for the CR to transmit data with CB designed 
via the estimated EIC. We present two algorithms for 
CRs to estimate the EIC, under different assumptions 
on the availability of the noise power knowledge at 
the CR terminal. Furthermore, due to imperfect channel 
estimation, the proposed CB scheme results in leak- 
age interference at the PR terminals, which leads to 
an interesting learning-throughput tradeoff, i.e., different 
choices of time allocation between CR's channel learning 
and data transmission correspond to different tradeoffs 
between PR transmission protection and CR throughput 
maximization. We formulate the problem to determine 
the optimal time allocation for estimating the EIC to 
maximize the effective throughput of the CR link, subject 
to the CR transmit power constraint and the interference 
power constraints at the PR terminals; and derive the 
solution via applying convex optimization techniques. 

The rest of this paper is organized as follows. Section II 
presents the CR system model. Section III introduces the idea 
of EIC. Section IV studies the CB design based on the EIC un- 
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Fig. 1. Spectrum sharing between a CR link and a PR link. 

der perfect channel learning. Section V considers the case with 
imperfect channel learning, presents algorithms for estimating 
the EIC, and studies the learning-throughput tradeoff for the 
CR link. Section VI presents numerical results to corroborate 
the proposed studies. Finally, Section VII concludes the paper 
Notation: Scalar is denoted by lower-case letter, e.g., x, and 
bold-face lower-case letter is used for vector, e.g., x, and bold- 
face upper-case letter is for matrix, e.g., X. For a matrix S, 
Tr(S'), Rank(S'), l^], S'"\ S\ , and S" denote its trace, 
rank, determinant, inverse, pseudo inverse, transpose, and 
conjugate transpose, respectively. Diag(a;i, . . . ,xjv/) denotes 
a i\jf X M diagonal matrix with diagonal elements given by 
xi, . . . , xm. For a matrix Af , Amax(Af) and A,ni„(Af) denote 
the maximum and minimum eigenvalues of M, respectively. 
/ and denote the identity matrix and the all-zero matrix, 
respectively, with proper dimensions. For a positive semi- 
definite matrix S, denoted hy S >p 0, S^^^ denotes a square- 
root matrix of S, i.e., S^^^{S^^^)^ = S, which is assumed 
to be obtained from the eigenvalue decomposition (EVD) 
of 5': If the EVD of S is expressed as S' VSU" , 
then S^^^ = UT,^^^. \\x\\ denotes the EucUdean norm of a 
complex vector x. C^^ v denotes the space of .t x y matrices 
with complex entries. The distribution of a circular symmetric 
complex Gaussian (CSCG) vector with mean vector x and 
covariance matrix S is denoted by CJ\f{x, S), and ~ stands 
for "distributed as". E[-] denotes the statistical expectation. 
Prob{-} denotes the probability. max(a;,y) and min(a;,y) 
denote the maximum and the minimum of two real numbers, 
X and y, respectively. For a real number a, (a)+ = max(0, a). 

II. System Model 

For the purpose of exposition, in this paper we consider a 
simplified CR system as shown in Fig. 1, where a single CR 
link consisting of one CR transmitter (CR-Tx) and one CR 
receiver (CR-Rx) coexists with a single PR link consisting 
of two terminals denoted by PRi and PR2, respectively. The 
number of antennas equipped at CR-Tx, CR-Rx, PRi, and 
PR2 are denoted by Mt, Mr, Mi, and M2, respectively. It 
is assumed that Mt > 1, while Mr, Mi, and M2 can be 
any positive integers. For the PR link, it is assumed that 
PRi and PR2 operate in a TDD mode over a narrow-band 
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flat-fading channel. Furthermore, reciprocity is assumed for 
the channels between PRi and PR2, i.e., if the channel from 
PRi to PR2 is denoted by F e c^hxMi^ ^^^^ channel 
from PR2 to PRi becomes Without loss of generality 

(W.l.o.g.), the transmit beamforming matrix for PRj, j = 
1,2, is denoted by Aj e £MjXdj^ ^j^jj denoting the 
corresponding number of transmit data streams, 1 < dj < Mj. 
The transmit covariance matrix for PR_, is then defined as 
Sj = AjA^ . We assume that Aj is a full-rank matrix and 
thus Rank(S'_,) = dj. Furthermore, define Bi e C'^^^^'^i as 
the receive beamforming matrix for PRi and B2 G /[^^dixAh 
for PR2. Both Bj's are assumed to be full-rank. It is also 
assumed that PRi and PR2 are both oblivious to the CR, and 
treat the interference from the CR as additional noise. 

The CR is assumed to transmit over the same frequency 
band of the PR, and thus it needs to protect any active PR 
transmissions by limiting its resulted interference power levels 
at both PRj's to be below some prescribed threshold (to be 
specified later in Section V). Let H e £^^rxMt denote the CR 
channel, and Gj G C^'^^ ^ denote the interference channel 
from CR-Tx to PR^, j = 1,2. Let the transmit beamforming 
matrix of CR-Tx be denoted by a fuH-rank matrix Aq-r e 
(j-A/fXdcR^ where dcR < Mt and dcR = Rank(S'cR), with 
5cR denoting the transmit covariance matrix of CR-Tx, i.e., 
5cR — Ac^Aqy{- Note that in Fig. 1, the channels from 
PRj's to CR-Rx are not shown, while similarly as for the PR 
terminals, it is assumed that any interference from PRj 's over 
these channels is treated as additional noise at CR-Rx. 

In [14], various CB designs in terms of the CR transmit 
covariance matrix, 5cr, have been studied for a similar 
system setup like that in Fig. I, where the CR channel transmit 
rate is maximized under the CR transmit power constraint and 
a set of interference power constraints for the PR terminals. 
The CB designs in [14] assume that in Fig. 1, CR-Tx has 
perfect knowledge of H, Gi, and G2- In this work, however, 
we remove the assumption on any prior knowledge of Gi and 
G2 at CR-Tx for the design of CB, since in practice the CR 
and PR systems usually belong to different operators, and it 
is thus difficult to require PRs to estimate CR-to-PR channels 
and then feed them back to the CR. As such, the most feasible 
method for the CR to learn some knowledge of CR-to-PR 
channels is to observe the PR signals propagating through 
PR-to-CR channels and then apply the channel reciprocities 
between CR-Tx and PRj's. Thus, in this paper we propose 
a learning-based CR transmit strategy, where the CR first 
observes the received PR signals to extract CR-to-PR channel 
knowledge, and then designs the CB based upon the obtained 
channel knowledge. However, there are several issues related 
to this approach, which are pointed out as follows: 

• What CR-Tx can possibly estimate are indeed the "ef- 
fective" channels, Gf Aj, from PRj, j = 1, 2, instead of 
the actual channels, Gj 's, if the PR transmit beamforming 
matrices, Aj's, are not known at CR-Tx. 

• The proposed CB schemes in [14] require that the chan- 
nels associated with Gi and G2 be separately estimated 

'The results of this paper hold similarly for the case where F"^ instead of 
is used to represent the reverse channel of F. 



at CR-Tx. As such, CR-Tx needs to synchronize with 
PR TDD transmissions, which requires knowledge of the 
exact time instants over each transmit direction between 
PRi and PR2. 

* If CR-Tx designs ^cr based on the effective channels, 
Gf Aj's, it is unclear whether the effect of its resulted 
interference at PRj's can be properly controlled since the 
signals from CR-Tx interferer with PRj via the equivalent 
channel, BjGj, which can be different from A^Gj if 
the PR receive beamforming matrix Bj differs from Aj . 

Therefore, to make the learning-based CB feasible for prac- 
tical systems, the above issues need to be carefully addressed, 
without critical assumptions or prior knowledge on PR signal 
processing procedures. In this paper, we propose an effective 
solution to resolve the aforementioned issues, while utilizes a 
new idea, namely effective interference channel (EIC), as will 
be presented next. 

III. Effective Interference Channel 

For the learning-based CB scheme, suppose that prior to 
data transmission, CR-Tx first listens to the frequency band 
of interest for PR transmissions over N symbol periods. The 
received baseband signals can then be represented as 



y{n) = GfAjtj{n) + z{n), n = I, 



(1) 



where j = 1 if n e Afi, and j = 2 if n e Af2, with 
A/i, A/2 C {1, . . . , A^} denoting the time instants when PRi 
transmits to PR2 and PR2 transmits to PRi, respectively, and 
Afi n A2 = due to the assumed TDD mode; tj{nys are 
the encoded signals (prior to power control and precoding) 
for the corresponding PRj, and for convenience it is assumed 
that tj(7i)'s are independent over n's and E,[tj{n){tj{n))^] = 
Id-xd-, j = 1,2; 2;(7i)'s are the additive noises assumed 
to be independent CSCG random vectors with zero-mean 
and covariance matrix denoted by polMtxMf Denote the 
cardinality of the set JVj as \JVj\. It is reasonable to assume 
that PRj will transmit, with a constant probability aj < 1, 
during a certain time period. Mathematically, we may use 



E 



N 



N 



a, or 



N 



aj. Note that ai + a2 < 1, 
where a strict inequality occurs when there are guard (silent) 
intervals between alternate PR TDD transmissions. Also note 
that if ai = q;2 = 0, there will be no active PR transmissions 
in the observed frequency band. 

Define Sj{n) as qj{n)tj{n), where qj{n) = 1, if n G 
Afj and qj{n) — otherwise. Obviously, qj(ri)'s are ran- 



dom variables with E[qj{n)] — aj 
q2(n) are related by qi{n)q2{n) 



Meanwhile, qi{n) and 
OjVn. Thus, we have 



E[sj(n)(sj(n))^] = a^I, j = 1,2, but E[si(n)(s2(n))^] = 
0. The signal model in (1) can then be equivalently rewritten 

as 

y{n) = As{n) + z{n), n=l,...,N (2) 

where A = [G'^ Ai^G" A2] and s{n) 
[(si(n))^, (s2(n))^]^. The covariance matrix of the 
received signals at CR-Tx is then defined as 



=E[y(n)(y(n))^] =Q,+po/ 



(3) 
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where 

^aiGfSiGi +a2Gf52G2 (4) 

denotes the covariance matrix due to only the signals from 
PR/s. 

Practically, only the sample covariance matrix can be ob- 
tained at CR-Tx, which is expressed as 

1 ^ 

n=l 

From law of large number (LLN), it is easy to verify that 
Qy pqI with probabiUty one as N oo, while for 

finite values of N, can only be estimated from Qy~ Denote 
Qj, as the estimate of from Q . Note that should be 

a covariance matrix and hence :>= and Qs = Qs- Next, 
we denote the aggregate "effective" channel from both PR/s 
to CR-Tx as 

Geli- = QT (6) 

while under the assumption of channel reciprocity, we denote 
the effective interference channel (EIC) from CR-Tx to both 
PRj's as Goff. 

In the rest of this paper, CB schemes based on the EIC 
instead of the actual CR-to-PR channels will be studied. Note 
that with the EIC, the first two items of implementation issues 
raised in Section II are resolved. The first issue is resolved 
since the EIC is defined over the effective channels from PR/s 
to CR-Tx instead of the actual channels, while the second issue 
is resolved since the EIC does not attempt to separate the two 
channels from PRj's to CR-Tx, and thus synchronization for 
CR-Tx with each transmit direction between PRi and PR2 is 
no longer required. However, we still need to address the third 
issue on analyzing the effect of the CR's interference on the 
PR transmissions with the ElC-based CB design. We will first 
address this issue for the ideal case where the estimation of 
Gcff is perfect or noiseless in Section IV, in order to gain some 
insights into this problem. Then, we will study this problem 
for the more practical case where Gefi is imperfectly estimated 
due to finite values of N in Section V. 

IV. Perfect Channel Learning 

In this section, we design the CR transmit covariance matrix, 
ScR, in terms of the equivalent transmit beamforming matrix, 
Acn, which contains information of both transmit precoding 
and power allocation at CR-Tx, under perfect learning of 
the EIC, i.e., the noise effect on estimating from Qy is 
completely removed. For this case, = in (6), and from 
(4) it follows that the EIC can now be expressed as 

^ UaiGf SiGi + a^G^ S2G2) j . (7) 

From (7), due to independence of the channels Gi and G2, 
it follows that dcs ~ RELnk(Gofi) = min((ii + d2, Mt). Thus, 
if the number of antennas at CR-Tx, Mt, is strictly greater 
than the total number of transmit data streams between PRi 

^Algorithms for such an estimation are discussed in Section V-A. 



and PR2, di + d2, then the EIC -based CB design will have 
at most Mt — {di + ^2) number of spatial dimensions or 
degrees of freedom (DoF) [15] for transmission, where all 
these dimensions he in the null space of Goff. Based on this 
observation, we obtain the following proposition: 

Proposition 4.1: Under perfect learning of the EIC, if 
the conditions AfGj □ BjGjJ = 1,2 hold,"* then the 
ElC-based CB design satisfying the constraint GcsAcr = 
will have no adverse effects on PR transmissions, i.e., 
BjGjAcn^O,:) = 1,2. 

The conditions in the above proposition can also be ex- 
pressed as Span(Aj^Gj) 3 Span(JBjGj), j = 1,2, where 
Span(X) denotes the subspace spanned by the row vectors 
in X. Intuitively speaking, these conditions hold when the 
transmit signal space of PR^ after propagating through the PR- 
to-CR channel G^ , i.e., G^Aj, if being reversed (conjugate 
transposed), will subsume the equivalent channel from CR-Tx 
to PRj, BjGj, as a subspace, for both j = 1,2. Note that 
A^Gj and BjGj may not have the same column size, and 
Aj and Bj may differ from each other for any j = 1, 2. As 
such, the validity of these conditions needs to be examined 
for practical systems. Thus, before we proceed to the proof 
of Proposition 4.1, we present two typical examples of multi- 
antenna transmission schemes for the PR link as follows, for 
both of which the conditions A^ Gj □ Bj Gj,j = 1,2 are 
usually satisfied. 

Example 4.1: Spatial Multiplexing: When the PR CSI is 
unknown at transmitter but known at receiver, the spatial 
multiplexing mode is usually adopted to assign equal powers 
and rates to transmit antennas (e.g., the V-BLAST scheme 
[15]). For this case, the transmit covariance matrix at PR,, 
j = 1,2, reduces to Sj = jj-I MjxMj, with Pj denoting 
the transmit power of PRj. Thus, dj = Mj, and A/s are 
both scaled identity matrices. It then follows that Aj Gj □ 
BjGj,j = 1,2, holds regardless of Bj. 

Example 4.2: Eigenmode Transmission: When the PR CSI 
is known at both transmitter and receiver, which is usually a 
valid assumption for the TDD mode, the eigenmode trans- 
mission mode is usually adopted to decompose the multi- 
antenna PR channel into parallel scalar channels [15]. For 
this case. Si and S2 are designed based on the singular-value 
decomposition (SVD) of F and , respectively. Let the SVD 
of F be Uf'S^fV". It then follows that Ai = Vf{i)K^^^ 
Bi = yf(2), A2 = C/p(2)A2^^, and B2 = C/f(i), where 
Aj = Diag(Aj,i, . . . , Aj d ) is a positive diagonal matrix with 
Aj_i,i = l,...,o?j, denoting the power allocation over the 
ith transmit data stream, and Vpij) (^F{j)) consists of the 
first dj columns in Vp (Up)- Note that in this case dj < 
min(Mi,M2). If it is true that di = d2, i.e., both transmit 
directions between PRi and PR2 have the same number of 
data streams, then it follows that A^Gj □ BjGj holds 
for both j = 1,2. Note that a valid special case here is the 
"beamforming mode" [15] with di = d2 = 1. 

■^X Zl Y means that for two given matrices with the same collum size, 
X and Y, if Xe = for any arbitrary vector e, then Ye = must hold. 

"^Note that when the conditions in Proposition 4.1 are not satisfied, the 
proposed CB scheme will cause certain performance loss to PR transmissions 
even under perfect channel learning. 
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Next, we present the proof of Proposition 4.1 as follows: 
Proof: First, with perfectly known Gcs, Gcs 3 -^fGj 
is true for j = 1,2. This can be shown as follows given 

any arbitrary vector e: Gdie = ^ [qJ = ^ 
{Qy^)"e = e"Q,e = ^ ||AfG,ef = 0,j = 
1,2 ^ AfGjB = 0,j = 1,2, where (a) is from (6), (b) 
is due to = Q^, and (c) is from (4). Since for arbitrary 
matrices X,Y, and Z, X □ Y and Y ^ Z imply that 
X □ Z, from Goff □ AfGj (shown above) and AfG-j □ 
BjGj (given in Proposition 4.1) it follows that Ges 3 -BjGj, 
j = 1, 2. Therefore, if the constraint GeS^CR = is satisfied, 
it follows that BjGjAcn ~0,j = 1,2, i.e., the interference 
from CR-Tx at PRj lies in the null space of the corresponding 
receiver beamforming matrix Bj, and thus has no adverse 
effects. ■ 
From Proposition 4. 1, it is known that if the given conditions 
are satisfied, it is sufficient for us to design Acr subject to 
the constraint Goff Acr = 0, in order to remove the effects 
of the CR signals on PR transmissions. Let the EVD of 
be represented as Q,. = VSV", where V G C^txci^tt 
and S is a positive dcff x doff diagonal matrix. Note that 
Rank((53) = dcff. From (6), G^ can then be represented 
as G^ff ~ VE^^^. Define the projection matrix related to V 
as Pv ^ I - VV" = UU", where U G <cMty.(Mt-d,n) 
satisfies V"U = 0. We are now ready to present the general 
form of AcR satisfying the constraint Gcff Acr = as [16] 

AcR = UC]!l (8) 

where G^r G c(A/t-d<,ff)xdcR ^jt^ ^^^^ denoting the 
number of transmit data streams of the CR, and Gcr G 
C(A/t-d<,tf)x(Mt-defi) satisfies that Gcr ^ and Tr(GcR) = 
Tr(5'cR) < PcK^ with PcR denoting the transmit power 
constraint of CR-Tx. From (8), it follows that designing the 
transmit beamforming matrix Acr for the CR channel be- 
comes equivalent to designing the transmit covariance matrix 
Gcr for an auxiliary multi-antenna channel, HU, subject to 
transmit power constraint, Tr(GcR ) < ^CR - This observation 
simplifies the design for the remaining part in ^cRj i-S-j Cqk, 
since existing solutions (see, e.g., [15] and references therein) 
are available for this well-studied precoder design problem. 

At last, we demonstrate an interesting property for the 
proposed CB scheme given in (8), when the conditions given 
in Proposition 4.1 are satisfied, and furthermore, PRi and/or 
PR2 have multi-antennas but transmit only over a subspace of 
the available spatial dimensions, i.e., dj < mm{Mi, M2), j = 
1, 2. For this case, we will show that the proposed scheme in 
(8) with the EIC Gcff can be superior over the conventional 
"projected-channel SVD (P-SVD)" scheme proposed in [14] 
with the actual CR-to-PR channels Gi and G2, in terms of 
the achievable DoF for CR transmission. At a first glance, this 
result is counter-intuitive since Gcff contains only partial infor- 
mation on Gj's. The key observation here is that Gcff contains 
information on A^ Gj's, which also exhibit side information 
on BjGfs via the conditions, A^Gj □ BjGj,j = 1,2, 
given in Proposition 4.1, while BjGj's are assumed to be 
unknown for the P-SVD scheme. More specifically, for the 
proposed scheme, the DoF is given as den, which can 
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Fig. 2. CR capacity comparison for the proposed CB scheme and the P-SVD 
scheme in [14]. 

be shown to be upper-bounded by min(i\/t — doff, Mr) = 
min((A/t — di — ^2)^, Mr). In comparison with the proposed 
scheme, the P-SVD scheme with perfect knowledge of Gi 
and G2 removes the interference (thus having no effects on 
PR transmissions like the proposed scheme) at both PRi and 
PR2 via transmitting only over the subspace of H that is 
orthogonal to both Gi and G2, thus resulting in the DoF 
to be at most min ((Mt - Mi - M2)+,A/r). Therefore, the 
proposed scheme can have a strictly positive DoF even when 
Ml + M2 > Mt, provided that di + d2 < Mt, i.e., the total 
number of antennas of PR/s is no smaller than Mt, while 
the total number of data streams over both transmit directions 
between PRi and PR2 is smaller than Mt, while the P-SVD 
scheme has a zero DoF in this case since Mt < Mi + M2. In 
most practical cases, we have dj < min(Mi,M2),j = 1,2. 
It thus follows that (di + ^2) < [Mi + M2) and thus the 
DoF gain of the proposed scheme against the P-SVD scheme, 
min((A/t - di - ^2)+ , Mr) - min((Mt - Mi - M2)+, M^), is 
always non-negative, while the maximum DoF gain occurs 
when di = d2 = 0, i.e., when the PR link switches off 
transmissions. Note that this DoF gain is achieved by the CR 
via exploiting side information on PR transmit (on/off) status 
or signal dimensions extracted from the observed EIC. This 
also justifies our previous claim that the OSS operation mode 
with learning-based CB is potentially more spectral efficient 
than the conventional interweave and underlay methods. 

Example 4.3: The capacity gain of the proposed scheme 
in (8) over the P-SVD scheme in [14], as above discussed, 
is shown in Fig. 2 for a PR link with Mi = M2 = 2, 
di ~ d2 ~ 1 (i.e., beamforming mode corresponding to the 
largest channel singular value in Example 4.2 is used), and a 
CR link with Mt = 5 and Mr = 3. All the channels involved 
are assumed to have the standard Rayleigh-fading distribution, 
i.e., each element of the channel matrix is independent CSCG 
random variable ~ CJ\f{0, 1). For simplicity, it is assumed that 
the interference due to PR transmissions at CR-Rx is included 
in the additive noise, which is assumed to be CAf{0, pil). 
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Fig. 3. Two-phase protocol for CR block-based transmission. 



The signal-to-noise ratio (SNR) in this case is thus defined 
as Pck/ Pi- The DoF can be visually seen in the figure to 
be proportional to the asymptotic ratio between the capacity 
value over the log-SNR value as SNR goes to infinity [15]. 
It is observed that the DoF for the proposed scheme is 
approximately three times of that for the P-SVD scheme in 
this case, since min((Mt — rfi — ^2)^ , Mr)/ min((Mt — Mi — 
M2)+,M,) = 3/1 = 3. 

V. Imperfect Channel Learning 

In the previous section, CB designs have been studied under 
the assumption that the EIC Gcfr is perfectly estimated at CR- 
Tx. In this section, we will study the effect of imperfect esti- 
mation of Goft due to finite sample size N on the performance 
of the proposed CB scheme. First, consider the following 
two-phase protocol for CRs to support the learning-based CB 
scheme as shown in Fig. 3, where each block transmission 
of the CR with duration T is divided into two consecutive 
sub-blocks. During the first sub-block of duration r, the CR 
observes the PR transmissions and estimates Goft; during the 
second sub-block of duration T — r, the CR transmits data 
with the CB design based on the estimated Gcfr (following 
the same procedure as in Section IV, but with the estimated 
GofT instead of its true value). Note that T needs to be chosen 
such that, on the one hand, to be sufficiently small compared 
with the channel coherence time in order to maintain constant 
channels during each transmission block, and on the other 
hand, to be as large as possible compared with the inverse 
of the channel bandwidth in order to make T constitute a 
large number of symbols to reduce the overhead of channel 
learning. In this paper, we assume that T is preselected as 
a fixed value. For a given T, intuitively, a larger value of 
T is desirable from the perspective of estimating Gofr, while 
a smaller t is favorable in terms of the effective CR link 
throughput that is proportional to (T — t) /T. Therefore, there 
exists a general learning-throughput tradeoff for the proposed 
CB scheme,^ where different choices for the value r lead to 
different tradeoffs between PR transmission protection and CR 
throughput maximization. 

The rest of this section is organized as follows. We first 
present two practical algorithms for CRs to estimate Gofr with 
a finite in Section V-A. Then, in Section V-B we analyze the 
so-called "effective leakage interference" at PR terminals for 
the proposed CB scheme due to estimation errors in Gofr. At 
last, in Section V-C we characterize the learning-throughput 
tradeoff for CRs by finding the optimal value of r to maximize 

'Note that the learning-throughput tradeoff includes the sensing-throughput 
tradeoff studied in [17] as a special case since CR channel sensing to detect 
PR's on/off status [17] can be considered as a "hard" version of CR channel 
learning proposed in this paper. 



the CR link effective throughput, under the given T, Pcr, and 
maximum tolerable leakage interference power level at PRj 's. 

A. Estimation of Goff 

From (6), it is known that Goff depends solely on Q,,, which 
is the estimate of the received PR signal covariance matrix 
Qs defined in (4). Thus, in this subsection, we present two 
algorithms to obtain Q ^ from the received sample covariance 
matrix given in (5). Denote the EVD of as 



TyAyTy 



(9) 



where Ay = Diag(Ai, A2, . . . , Aj\/j) is a positive diagonal 
matrix whose diagonal elements are the eigenvalues of Qy. 
W.l.o.g., we assume that A/s, i = 1, . . . , Mt, are arranged in 
a decreasing order We then obtain from Qy based on the 
standard maximum likelihood (ML) criterion, for the following 
two cases: 

1) Known noise power pq: In the case where the noise 
power, Pq, is assumed to be known at CR-Tx prior to channel 
learning, it follows from [18] that the ML estimate of is 
obtained as 

H 



= TyDiag (Ai - po) 



. (Aa/, - Po)+ T 



(10) 



The rank of Q^, or the estimate of dcs, denoted as d^s, can 
be found as the largest integer such that Aj > po- Therefore, 
the first deft columns of Ty give the estimate of V, denoted 
by V, and the last Mt — doff columns of Ty are deemed as 
the estimate of U, denoted by U. Note that U will replace 
the true value of U in (8) for the proposed CB design in the 
case with imperfect channel learning. 

2) Unknown noise power po: In this case, po is unknown 
to CR-Tx and has to be estimated along with Q,. The ML 
estimate of po can first be obtained as [19] 



1 



Po 



Mt - doff 



Mt 

= rfcff + l 



(11) 



where doft is the ML estimate of doff. Specifically, dotr can be 
obtained as [19] 

, ^;i/(A/t-fe)^ 

doft = arg max {Mt — k)N log 



1 sr^Mt \ 



: arg max 



(12) 



where GM(fc) and AM(fc) denote the geometric mean and 
the arithmetic mean of the last Mt — k eigenvalues of Qy, 
respectively. To make this estimation unbiased, we convention- 
ally adopt the so-called minimum description length (MDL) 
estimator expressed as [19] 

doft = arg mm {Mt-k)N log (^^) +\k{2Mt-k) logiV 

(13) 

where the second term on the right-hand side (RHS) is a bias 
correction term. The ML estimates of V and U, denoted by 
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V and U, are then obtained from the first doff and the last 
Mt — dcs columns of Ty, respectively. 

After knowing po, d^s, V, and t/, the ML estimate of 
is obtained as 

= VDiag (Ai - PO, . . . , A^;^,^ - po) v" ■ (14) 

From (10) and (14), it is observed that the above two 
estimators have a similar structure while they differ in the 
noise power term adopted and the way to estimate the rank of 

B. Effective Leakage Interference 

Due to imperfect channel estimation, the CB design in (8) 
with U replaced by XJ cannot perfectly remove the effective 
interference at PRj's. In this subsection, the effect of channel 
estimation errors on the resultant leakage interference power 
levels at PRj's will be analytically quantified so as to assist 
the later studies. Define the rank over-estimation probability 
as Po{k) = Prob((icff - defT = A;|deff), fc = 1, . . . ,defT, and 
the rank under-estimation probability as Pu{k) = Prob((icff — 
des = k\dos), k = 1, . . . , Mt — doff, both conditioned on the 
observation deS- If the over-estimation of dcS is encountered, 
the upper bound on the number of data streams from CR-Tx, 
dcR, may be affected. However, as long as {Mt — doff) > Mr, 
dcR is more tightly bounded by Mr and the over-estimation 
of doff does not cause any problem. On the other hand, the 
under-estimation of doff will bring a severe issue, since some 
columns in C/ may actually come from the PR signal subspace 
spanned by V. In this case, the amount of interference at 
PRs will be tremendously increased, which is similar to the 
scenario in the conventional interleave-based CR system when 
a misdetection of active PR transmissions occurs to the CR. 
In practice, a threshold ^ should be properly set, and the last 
Mt — (doff + fco) columns in Ty are chosen as tj only if 

Detailed study on po{k), Pu{k), and ^ is deemed as a 
separate topic of this paper and will not be further addressed 
here. In this paper, for simplicity we will assume that the rank 
of or doff is correctly estimated. We will then focus on 
studying the effect of finite N on the distortion of the estimated 
eigenspace U. From (8), the transmit signal at CR-Tx in the 
case of imperfect channel learning is expressed as 

ScR(n) = AcntcRin) = UC'J^tcRin), n>N (15) 

where scR(n) is the precoded version of the data vec- 
tor tcR{n). Note that E[tcR{n){tcRin))"] = I and 
]E[scr('t-)(scr("-))^] = ScR. The average leakage interfer- 
ence power at PRj,j = 1,2, due to the CR transmission is 
then expressed as 

I, =E[\\B,GjScBin)f]. (16) 

Next, Ij is normalized by the corresponding processed (via 
Bj) noise power to unify the discussions for PR^'s. For 
convenience, it is assumed that the additive noise power at 
PRj is equal to po, the same as that at CR-Tx, and thus the 



processed noise power becomes po'Tr{BjBj ). Define 

Ij = TT- (17) 

' PoMB.Bf) 

Ij is then named as the "effective leakage interference power" 
at PRj since it measures the power of interference normalized 
by that of noise after they are both processed by the receive 
beamforming matrix, Bj. 

Lemma 5.1: The upper bounds on Ij, j = 1,2, are given 

as 

- ^ Tt{Ccr) Xma.x{GjGf) 

ajN A,„i„(AfG,Gf A,)' 
Proof: Please refer to Appendix I. ■ 
From Lemma 5.1, it follows that the upper bound on Ij is 
proportional to the CR transmit power Tr (Cci?), but inversely 
proportional to aj, N, and the PRj's average transmit power 
Pj (via Aj). Some nice properties of the derived effective 
leakage interference powers associated with the proposed CB 
scheme are listed as follows: 

• Ij is upper-bounded by a finite value provided that aj > 
0.^ Note that Xn,in{Af G^Gf Aj) > if Mt > dj and 
in this case A^Gj is a full-rank and fat matrix. 

> Ij can be easily shown to be invariant to any scalar mul- 
tiplication over Gj . Thus, the CR protects PRj 's equally 
regardless of their location-dependent signal attenuation. 

* Since for fixed N and Pgr the upper bound on Ij is 
inversely proportional to aj and Pj, PRj gets better 
protected if it transmits more frequently and/or with more 
power. This property is useful for the CR to design /a/r 
rules for distributing its leakage interference among the 
coexisting PRs. 

Example 5.1: In Figs. 4 (a) and 4 (b), numerical results on 
/j's given in (17) as well as theoretical results on the upper 
bounds on I j's given in (18) are compared for PR SNR being 
15dB and OdB, respectively. Note that Pi = P2 = P in 
this example and PR SNR is defined as P/pa, where po is 
assumed to be known at CR-Tx. For the PR, it is assumed 
that Ml = M2 = 1, ai 0.3, and 02 = 0.6, while for 
the CR, Mt = 4, Pqr, = 100, and Ccr is designed for the 
eigenmode transmission. 2,000 random channel realizations 
are used for averaging while the standard Rayleigh fading 
channel distribution is adopted. To clearly see the effect of 
N, we take the inverses of I j's or their upper bounds for the 
vertical axis of each figure. It is observed that at high-SNR 
region, the theoretical and numerical results match well, and 
the interference powers are inversely linearly proportional to 
N. However, at low-SNR region, there exists big mismatch 
between the two results. This is reasonable since the first order 
approximation of (30) in Appendix I is inaccurate at low- 
SNR region. Nonetheless, the good news is that the inverse of 
interference power is observed to be still linearly proportional 
to N from the numerical results. 

''Note that the derived upper bound on Ij is practically meaningful when aj 
is a non-negligible positive number, since in the extreme case of aj = 0, PRj 
switches off its transmissions and as a result discussion for the interference 
at PR, becomes in'elevant. 
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Fig. 4. Leakage interference power levels at PRi and PR2 for different PR 
SNRs. 



C. Optimal Learning Time 

At last, we study the learning-throughput tradeoff for CRs 
by determining the optimal learning time r for a given T to 
maximize the CR link throughput, subject to both transmit 
power constraint of the CR and effective leakage interference 
power constraints at the two PR terminals. It is assumed that 
the CR channel H is known at both CR-Tx and CR-Rx. From 
(8) with U replaced by C/, the maximum CR link effective 
throughput, assuming the receiver noise ~ CJ\f{0, pil), is 
expressed as [22] 



T 



T 



log 



I + HUCcrU"h"/pi 



(19) 



where the term {T — t)/T accounts for the portion of through- 
put loss due to channel learning. 

If the peak transmit power constraint for the CR is adopted, 
we have Tr (Cci?) < PcR.^ while if the average transmit 
power constraint is adopted, we may allocate the total power 
for each block to the second phase transmission, resulting in 
Tr(Cc_R) < j^PcR- Let F denote the prescribed constraint 
on the maximum effective leakage interference powers, Ij's 



defined in (17). Note that N is related to r by iV = t/T^, 
where Tg denotes the symbol period. From Lemma 5.1, it 
follows that it is sufficient for Ccr to satisfy the following 
inequality to ensure the given interference power constraints: 



where 



7i 



Tr (Cci?) < 7jT, j = l,2 

^5 Amax(GjG:^) 



(20) 



(21) 



and Cj, Q < 1, is an additional margin that accounts for 
any analytical errors (e.g., approximations made at low-SNR 
region in Example 5.1). In practice, the choice of 7j's in (21) 
depends on the calibration process at CR-Tx, based on prior 
knowledge of Q's, F, and Tg, as well as the observed average 
signal power from PRs. 

Let 7 = min(7i,72). Then, the interference power con- 
straints in (20) become equivalent to Tr (Cci?) < Jt. The 
problem for maximizing the CR effective throughput is thus 
expressed as 



(PI) 



T-T 



max 

T,CcR 



log 



H _ 



I + HUCcrU" H"/pi 
< J, Ccr >0, 0<t<T 



T 

s.t. Tr(Cci?) 
where J = mhi{PcR, 7t) for the case of peak transmit power 
constraint, while J = min {^7^--pPcR-, 7^^ for the case of 
average transmit power constraint. 

For problem (PI), it is noted that TJ is related to r, which 
makes the maximization over r difficult. However, it can be 
verified that the matrix norm of /SU = TJ — U decreases 
in the order of 0{1/^), as compared to the norm of U. 
Therefore, the overall term U ^ U + AU in the objective 
function is dominated by U, and changes slowly with r when 
T is sufficiently large. Thus, we assume that the effect of r on 
U is ignored in the subsequent analysis, and will verify this 
assumption via simulation results in Section VI. 
Let the EVD of H" HU be Uh'^hU", where Uh 
4ff) X {Mt 



is a (Mt 

Diag((T2i,...,(T^j^^^_^ 



c?off) unitary matrix and S/j = 
^^). W.l.o.g., we assume that cr^j's 



are arranged in a descending order Note that if (Mt — doff) > 
Mr, then ah/s, i = Mr + 1, . . . ,Mt — doff, all have zero 
values. Define X as CqkU h- Problem (PI) is then 
converted to 

^"^log|J + XSJpi| 



(P2) 



max 



T 

s.t. Tr(X) < J, 

where the optimal Ccr can be later recovered as UhXUf^ . 
By the standard approach like in [22, Chapter 10.5], it can 
be shown that the optimal X is a diagonal matrix X = 



Diag(a:i, . . .,XMt- 
obtained from 



d^ft) and Xi's, i = 1, . . . , Mt - doff, are 



(P3) 



max 



s.t. 



T-T 

T 

E 



Mt-d^ 



J2 log 



1=1 



< J, 



pi 



> 0, < T < r. 
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Next, we will study (P3) for the cases of peak and average 
transmit power constraints, respectively. 

1 ) Peak transmit power constraint: In this case, if Pcr > 
7T, then J is always equal to jt. Therefore, we consider the 
more general case with Pcr < 7T. The remaining discussion 
will then be divided into the following two parts for PckIi < 
T < T and < t < PckIi^ respectively. 

If Pgr/ J < T < T, then J = Pcr ™d the optimization 
in problem (P3) over r and z^'s can be separated. The 
optimization over x^'s directly follows the conventional water- 
filling (WF) algorithm [22]. For the ease of later discussion, 
we define 



max 



s.t. 



Mt -do 

E 

1=1 

E 



log 1 



Xi < z, 



Pi 



> 0. 



(22) 



The WF solution of the above optimization problem is then 
given as x, ^ (— ^)^, where — is the water level that 



should satisfy 



Denote qk 



_ fcpi 
75 

Obviously, = 0, and qM, 





Pi 






A; 

2-^i=l 


Pi 

a 'i ■ ' 

h , I 



(23) 



for k = 0, ... ,Mt- doff ■ 



-00 smce a 



set to be zero. Then, we can express f{z) as 



/i,A/t-deff + l 




z e [qk-i,qk] 



(24) 

Note that fc is the number of dimensions assigned with positive 
.Ti's. The objective function of problem (P3) in this case can 
then be explicitly written as 

T-T 



T 



-/(^cr). 



(25) 



Since -^^^ is a decreasing function of t, the optimal t to 
maximize 51 (r) over Pcr/t < t < T is simply PcrJi- 

Next, consider < r < Pck/i- In this case, J = 7r, and 
problem (P3) becomes 

32 (r) ^ ^/(7t). (26) 



max 

0<t<Pcr/7 



In order to study the function 32 (''■), some properties of the 
function f{z) are given below. 

Lemma 5.2: f{z) is a continuous, increasing, differentiable, 
and concave function of z. 

Proof: Please refer to Appendix II. ■ 
With Lemma 5.2, it can be easily verified that (72 (t) is also 
a continuous, differentiable, and concave function of r. Thus, 
the optimal value of r, denoted as r|, to maximize 32 (t) can 
be easily obtained via, e.g., the Newton method [23]. 

To summarize the above two cases, the optimal solution of r 
for problem (P3) in the case of peak transmit power constraint 
can be obtained as 



r^, rl < Pcr/7 

Pcr/Ii otherwise. 



(27) 




(b) r* = Pch/7 < T-2* 

Fig. 5. Illustration of the optimal learning time r* for the case of peak CR 
transmit-power constraint. 



The above solution is illustrated in Fig. 5. The optimal value 
of (P3) then becomes 52 (tj) if < Pck/i, and gi{PcRh) 
otherwise. 

2} Average transmit power constraint: In this case, J in 
problem (P3) takes the value of T/(T - t)Pcr if T/{T - 
t)Pcr < IT, and 7r otherwise. It can be verified that T/ (T— 
t)Pcr < JT for some r in [0, T) only if Pcr/t < T/4. In 
other words, if Pcr/7 > T/A, J always takes the value 7r 
regardless of r. Thus, the objective function of (P3) is always 
given as .92 (t), and the optimal solution of r is Tj. 

Therefore, we consider next the more general case of 
Pcr /7 < r/4. For this case, it can be shown that the equation 
T/(r — t)Pcr = "fT always has two positive roots of r, 
denoted as r; and r^, respectively, and < t/ < < T. 
If < T < Ti or Tu < T < T, J takes the value of 7T, 
and then the maximum value of (P3) is obtained by the r 
that maximizes 172 (t) over this interval of r. Otherwise, the 
maximum value occurs when r is given as 

T 



arg max gz{T) = —^f . 

T,Ti<T<Tu 1 \ 1 



-Pgr 



(28) 



It can be shown that 53 (r) is a continuously decreasing 
function of t, for r e [0,T). Thus, the optimal value of r 
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(a) r* = t| < Ti 




(b) T* 



n < T2 



Fig. 6. Illustration of the optimal learning time t* for tire case of average 
CR transmit-power constraint. 
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Fig. 7. CR throughput versus CR learning time. 




to maximize 53 (r) over this interval of t is simply r;. 

To summarize the above discussions, we obtain the optimal 
solution of T for problem (P3) in the case of average transmit 
power constraint as 

T* = ( "'2' ""2 (29) 

[ Tj, otherwise. 

The above solution is illustrated in Fig. 6. The optimal value 
of (P3) then becomes .92 (tI) if T2 < ti, and .93 (tj) otherwise. 

VI. Numerical Results 

In the section, we present additional simulation results to 
demonstrate the performance of the proposed CB scheme 
under imperfect channel learning. The system parameters are 
taken as Mt — 6, Mr ~ 3, Mi = 4, and M2 = 2. Eigenmode 
transmission is considered for the PR with di = d2 = 2, 
and the PR SNR is set as 20dB. The channels F, d, G2, 
and H are randomly generated from the standard Rayleigh 
fading distribution, and are then fixed in all the examples. 
The parameters r and T are normaUzed by the symbol period 
Tg. After the normalization, T is set as 1, 000 and the lowest 
value of T is set as 10. The CR transmit rate is measured 



Fig. 8. Maximum CR throughput versus CR SNR. 



in nats/complex dimension (dim.). The peak transmit-power 
constraint for the CR is assumed. 

We first fix Pen at CR-Tx as 100 and show the variations 
of the CR throughput as a function of t. Both theoretical 
results obtained in Section V-C where U is not considered 
as a function of r and is replaced by the true value U, and 
numerical results where t/ is obtained via the estimator given 
in Section V-A with known noise power po, are shown in Fig. 
7. The values of 7 are taken as 0.2 and 0.6, respectively. From 
Fig. 7, the first observation is that the numerical and theoretical 
results almost merge with each other, which supports our 
previous assumption of ignoring [/ to be a function of r 
during the optimization process. We also observe that the CR 
throughput for 7 = 0.2 and that for 7 = 0.6 start to merge 
when T is sufficiently large due to the fact that 51 (r) defined 
in (25) does not change with 7. However, the maximum CR 
throughput is observed to increase with 7 because when the 
PRs can tolerate more leakage interference powers, the optimal 
learning time is reduced and the CR transmit power becomes 
less restricted, which leads to an increased CR throughput. 
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Fig. 9. Optimal CR learning time versus CR SNR. 



We then display the maximum CR throughput versus Pgr, 
or equivalently, the CR SNR, in Fig. 8 for different values 
of 7- Only the theoretical results are shown here. The first 
observation is that there exist thresholds on CR SNR, beyond 
which the maximum CR throughput cannot be improved for 
a given 7. This is because that when Pen is too large, the 
dominant constraint for CR throughput maximization becomes 
the interference-power constraint instead of transmit-power 
constraint. When this occurs, the intersection point Pcr/7 
in Fig. 5 moves towards T. Thus, the optimal value of r and 
the corresponding maximum CR throughput are determined 
from g2{T) in (26), which is not related to Pqr- Meanwhile, 
when 7 increases, the maximum CR throughput also increases, 
similarly like the case in Fig. 7. 

At last, we show the change of the optimal t with respect 
to PcR 01" the CR SNR in Fig. 9, where only the theoretical 
results are shown. From Fig. 5, we know that when Pgr 
decreases, the intersection point moves towards zero. Thus, 
the curves of the optimal learning time for different 7's all 
merge to the presumed minimum value for r, r = 10, at 
low-SNR region. On the other side, the optimal values of t 
stop increasing at high-SNR region for a given 7, similarly as 
explained for Fig. 8. Moreover, the optimal t is observed to 
increase with the decreasing of 7. 

VII. Concluding Remarks 

Cognitive beamforming (CB) is a promising technology to 
enable high-rate CR transmissions and yet to provide effective 
interference avoidance at the coexisting PR terminals. The 
main challenge for realizing CB in practical systems is how 
to obtain the channel knowledge from CR transmitter to PR 
terminals. In this paper, we propose a new solution to this 
problem utilizing the idea of effective interference channel 
(EIC), which can be efficiently learned at CR transmitter via 
blind/semiblind estimation over the received PR signals. Based 
on the EIC, we design a practical CB scheme to minimize the 
effect of the resulted interference on the PR transmissions. 
Furthermore, we show that with finite sample size for channel 



learning, there exists an optimal learning time to maximize the 
CR link throughput. 

The developed results in this paper can be readily extended 
to the case with multiple PR links. This is so because the 
proposed CB scheme is based on the EIC that measures the 
space spanned by all the coexisting PR signals as a whole, 
and thus it works regardless of these PR signals coming from 
a single PR link or multiple PR links. 

Appendix I 
Proof of Lejvima 5.1 

Define S = [s(l), . . . , s{N)] and Ys = AS where s(n)'s 
and A. are given in Section III. From [20, Appendix I], 
we know that the first order perturbation' to U due to the 
finite number of samples N and the additive noise Z = 
[z{l), . . . , z{N)] can be approximated by 

AU = U -U ^ -iY^)^Z"U. (30) 

Since the discussions on Ii and I2 are similar, in the 
following we restrict our study on Ii. From the conditions 
given in Proposition 4.1, we know that there exists a constant 
matrix Wi G C''^'"^^ such that Bid = WiA"Gi. The 
average interference power, Ii defined in (16), is then re- 
expressed as 



h *=^E[Tr(BiGiC/CcHC/"GfBf)] 
^^E[Tr{BiGiAUCcRAU"G^B^)] 
^^E[Tr{BiGi{Y^)^ Z^UCcrU" ZYIg^ B^)] 



H 



id) 



PoTr(Cci^)E[Tr(BiGi(yf)tl-tGfBf))] 



PoTr{CcR)E 



Tr(WiAf Gi(A 



if) 



iSS")-^A^G^AiW^) 

PoTr(Cci?)Tr (wi[I,0] 

Pq 

' aiN 







H 



Tr(GcfljTr(WiVFf) 



(31) 



where (a) is via substituting (15) into (16) and using the 
independence of XJ and fcR("^); (&) is due to BiGiU ~ 0; 
(c) is due to (30); (d) is due to independence of Yg and Z and 
E[Z"XZ] = poTr(X)J for any constant matrix X; (e) is 
due to the definitions of Wi and 1^^; and (/) is approximately 
true since N is usually a large number. 
From [21], we have 



Tr(W"iAfGiGf AiPFf ) 
>A„,in(AfGiGf Ai)Tr(W^iW^f) 



(32) 



Tr(BiGiGf ) < A,nax(GiGf )Tr(BiBf ). (33) 
WiA^Gi, from (32) and (33) it follows 

An.ax(GiGf)Tr(BiBf) 



By noting BiGi 
that 



Tr(WiVFf ) < 



Amin(^l^GiGf Ai 



H 



(34) 



^Note that the first order approximation is more valid at high-SNR region. 
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Using (17), (31), and (34), the upper bound on Ii given in 
(18) is obtained. 

Appendix II 
Proof of Lemma 5.2 

First, it is easily known that f{z) is an increasing function 
of z. Next, we prove the continuity, differentiabihty, and 
concavity of f{z), respectively. 



A. Continuity 

From (24), it is known that in each section [qk~i, qk], f{z) 
is obviously continuous. For boundary points of each section, 
we have 



lim_ /(z) = ^log 



i=l 



'h,k+l 



k = 1, . . . , Mt — doff — 1- Thus, f{z) is continuous at all the 
points. 

B. Differentiability 

From (24), it is known that in each section [qk-i, qk], f{z) 
is differentiable. For boundary points of each section, it can 
be verified that 



h,k+l 



lim /(z) 

z->q- Pi 



lim /(z), 



(36) 



k = 1, . . . , Mt — doff — 1- Therefore, /(z) is differentiable at 
all the points. 

C. Concavity 

For a given z, /(z) is obtained by solving the optimization 
problem in (22), which can be easily verified to be a convex 
optimization problem [23]. Thus, the duality gap for this 
optimization problem is zero and /(z) can be equivalently 
obtained as the optimal value of the following min-max 
optimization problem: 



f(z) = minmax > loefl 



Pi 



P-(^x,-z) (37) 



mm 



n5:(log(J^))+-^(l-^)++/.z (38) 
^ pi^ ^ at , 



PlM^ 

2 

h,' 



PlM 



'h,i 



(39) 



where the summations are taken over i = 1, . . . , Mt — d^s, 
and /x^^^ > is the optimal dual variable for a given z. In 
fact, it can be shown that 1/ pS^^ is just the water level given 
in (23) corresponding to the total power z. 

Denote uj as any constant in [0, 1]. Let p''^^\ fi'^^^K and /i^^^^ 
be the optimal p, for /(zi), /(Z2), and /(Z3), Z3 ~ uizi + (1 — 



uj)z2, respectively. For j = 1, 2, we have 



plp 



h,i 



(40) 



(41) 



where the inequality is due to the fact that /^j^^-'' is not the 
optimal dual solution for j ~ 1,2. Therefore, 



u;/(zi) + (1 - c^)/(z2) 



/(WZI + (1 - W)Z2). 

Thus, /(z) is a concave function [23]. 
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